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A method, apparatus, and article of manufacture for a computer implemented 
hypertext classifier. A new document containing citations to and from other 
documents is classified. Initially, documents within a neighborhood of the new 
document are identified. For each document and each class, an initial 
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Detailed Description Text - DETX (71) : 

A similar problem appears in inductive logic programming, as discussed in 
"Lavrac and Dzeroski". Suppose a relational table, or several tables, are 
given containing information about patients with hypertension. Apart from 
local attributes of a person, such as age and weight, there can be other 
attributes, such as father and mother, whose records are also in the database. 
Assuming there is .a relation between lineage and hypertension, the latter 
fields are used for inducing rules to classify patients as having high or low 
risk of heart attack. To do this, each record is augmented with fields from 
records of the parents. Rather than using raw data from the related records, 
the raw attribute values can be pre-processed into a synthesized feature. One 
example is whether the parent was classified as high or low risk. This is 
called feature engineering. 



Detailed Description Text - DETX (147) : 

6.1 Bridges "Co-citation" is a well -studied phenomenon in implicitly linked 
corpora, such as academic papers, as discussed in G. Salton and M. J. McGill, 
"Introduction to Modern Information Retrieval", McGraw-Hill, 1983, which is 
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incorporated by reference herein. Documents that cite or are cited by many 
common documents may be regarded as similar, much as documents sharing many 
terms are adjudged similar. Citation-based similarity, or a combination of 
citation and term-based similarity, can then be used to perform unsupervised 
clustering of documents, discussed in R. Weiss, B. Velez, M. A. Sheldon, C. 
Nemprempre, P. Szilagyi, A. Duda, and D. K. Gifford, "HyPursuit: A Hierarchical 
Network search Engine that Exploits Content-link Hypertext Clustering", Proc. 
of the Seventh ACM Conference on Hypertext, Washington, D.C., March 1996, which 
is incorporated by reference herein. 
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